ON SUMS OF THREE SQUARES 



S.K.K. CHOI, A.V. KUMCHEV, AND R. OSBURN 

Abstract. Let r^iri) be the number of representations of a positive integer n as a sum of three squares 
of integers. We give two alternative proofs of a conjecture of Wagon concerning the asymptotic value 
of the mean square of rj,(n). 



1. Introduction 

Problems concerning sums of three squares have a rich history. It is a classical result of Gauss 
that 

has a solution in integers if and only if n is not of the form 4 a (Sk + 7) with a, k e Z. Let r 3 (n) 
be the number of representations of n as a sum of three squares (counting signs and order). It was 
conjectured by Hardy and proved by Bateman [ 1 ] that 

(1) r 3 (n) = W /2 S 3 (n), 

where the singular series S^n) is given by (fToT) with 2 = oo. 

While in principle this exact formula can be used to answer almost any question concerning 
r 3 (n), the ensuing calculations can be tricky because of the slow convergence of the singular series 
63(71). Thus, one often sidesteps (HJ) and attacks problems involving r 3 0?) directly. For example, 
concerning the mean value of r 3 (n), one can adapt the method of solution of the circle problem to 
obtain the following 

r 3 (n) ~ ^nx 3/2 . 



n<x 



3 



Moreover, such a direct approach enables us to bound the error term in this asymptotic formula. 
An application of a result of Landau [9, pp. 200-218] yields 

n<x 

for all e > 0, and subsequent improvements on the error term have been obtained by Vinogradov 
lfT9l . Chamizo and Iwaniec 0, and Heath-Brown [6|. 

In this note we consider the mean square of r 3 («). The following asymptotic formula was con- 
jectured by Wagon and proved by Crandall (see |4| or 0). 
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Theorem. Let r?,(n) be the number of representations of a positive integer n as a sum of three 
squares of integers. Then 

8tt 4 



(2) $> 3 (n) 2 



n<x 



21f(3) 



x 2 . 



Apparently, at the time they discussed this conjecture Crandall and Wagon were unaware of the 
earlier work of Miiller [fTTllTTl. He obtained a more general result which, in a special case, gives 

n<x 

where B is a constant. However, since in Muller's work B arises as a specialization of a more 
general (and more complicated) quantity, it is not immediately clear that B = ^-7r 4 /^(3). The 
purpose of this paper is to give two distinct proofs of this fact: one that evaluates B in the form 
given by Miiller and a direct proof using the Hardy-Littlewood circle method. 

2. A direct proof: the circle method 
Our first proof exploits the observation that the left side of © counts solutions of the equation 

m\ + m\ + m\ = m\ + m\ + m\ 

in integers m\, . . . , m 6 with |m ; -| < x. This is exactly the kind of problem that the circle method was 
designed for. The additional constraint m\ + m\ + m\ < x causes some technical difficulties, but 
those are minor. 

Set N = yfx and define 

f(a) = ^e(am 2 ), 

m<N 

where e(z) = e 2mz . Then for an integer n < x, the number r*(n) of representations of n as a sum of 
three squares of positive integers is 

r*(n) = / f(a) 3 e(-an) da. 



Since r 3 (n) = 8r*(n) + 0(r 2 (n)), where r 2 (n) is the number of representations of n as a sum of two 
squares, we have 

o) r 3(") 2 = 64 r * (n)2 + °( xV2+£ ) ■ 

n<x n<x 

Therefore, it suffices to evaluate the mean square of r*(n). Let 

P = N/4 and Q = N 1 ' 2 . 

We introduce the sets 

m(q,a) = {a e [Q~\l + Q~ Y ] : \qa-a\ < PN~ 2 } 

and 

1=U [J W{q,a), m = [Q~\ 1 + Q~ l ] \ SR. 

q<Q \<a<q 



We have 



(4) 



r*(n) = [ I + I \f{aYe{-an)da 



= r*(n, 2ft) + r*{n, m), say. 



We now proceed to approximate the mean square of r*(n) by that of r*(n, 2ft). By @ and 
Cauchy's inequality, 



(5) Yl r » 2 = Yl r >' ^ 2 + °i^2) 1/2 + s 2 ) 

i<x n<x 

I 1 = ^|r>,2ft)| 2 , Z 2 = ^|r>, 



n<x n<x 

where 



m)| 2 . 



By Bessel's inequality, 

(6) |2 2 l = 



2 

\3. 



f(a) e(-an)da 



< / \f(a)\*da. 



By Dirichlet's theorem of diophantine approximation, we can write any real a as a = a/q + ft, 
where 

\<q<N 2 p-\ (a,q)=l, \/3\ < P/(qN 2 ). 
When a e m, we have q > Q, and hence Weyl's inequality (see Vaughan [ 18 , Lemma 2.4]) yields 

(7) \f(a)\ « N 1+e (q~ l + AT 1 + ^AT 2 ) 1/2 « N 1+ "Q- 1 ' 2 . 
Furthermore, we have 

(8) / \f(a)fda«N 2+£ , 



because the integral on the left equals the number of solutions of 

m 2 + m\ = m 2 + m 4 

in integers mi, . . . ,m 4 < N. For each choice of m\ and m 2 , this equation has «; N £ solutions. 
Combining ©-© and replacing 6 by 6/ 3, we obtain 

(9) S 2 «/Y 4+e 2- 1 . 

Furthermore, another appeal to Bessel's inequality and appeals to © and to the trivial estimate 
\f(a)\ < N yield 



(10) l!< / \f(a)fda< / \f(a)\ 6 da « V 



Hi 



r4+£ 



We now define a function /* on 2ft by setting 

f*(a) = q~ l S(q, a)v(a - a/q) for a e 2ft(g, a) Q 2ft; 

here 

S(q,a)= (ah 2 /q) , v(j3) = ]- ^ mr m e{fim). 

l<h<q m<x 
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Our next goal is to approximate the mean square of r*(n, 9J?) by the mean square of the integral 



R*(n) = I f(a) 3 e(-an)da. 



Similarly to ©, 
(11) 



r*(n,W) 2 = J>» 2 + 0(Z 3 + (£i£ 3 ) 1/2 ), 



where 
(12) 



' [f{af-r(af]e{-an)da 

m 



< / \f{af - f{af\ 2 da, 



m 



after yet another appeal to Bessel's inequality. By [18, Theorem 4.1], when a e Tl(q, a), 

f(a) = f(a) + 0(q 1 ^). 

Thus, 

(\f(a)t + q 1+A *)da, 



\f(a) 3 -f(af\ 2 da«q l+2£ 



•~))l{q,a) 



whence 



/(or) 3 - f(a) 3 \"da « / \f(a)f da + PQ^ be N 



t\+2e 



->4+6e » T-2 



Bounding the last integral using © and substituting the ensuing estimate into (fT2"l) . we obtain 

(13) S 3 « QN 2+2e + PQ 4 N~ 2+3£ « QN 2+2£ . 
Combining ©, ©-(EH), and (O, we deduce that 

(14) ^2r*{n) 2 = ^R*(n) 2 + 0(N 4+£ Q~ 1/2 +N 3+£ Q i/2 ). 

n<x n<x 

We now proceed to evaluate the main term in (fl4l . We have 



f*(a) 3 e(-an)da = q ^S(q,aYe(-an/q) I v(fiye(-fin)dfi, 



m(q,a) 



3J!(4,0) 



SO 



R*(n) = y^AQ?, n)I(q, n), 

q<Q 



where 



A(q, n) = q 3 S (q,a) 3 e(-an/q), I{q,n) = / v(J3) 3 e(-fin) dfi. 

\<a<n J Wo) 



l<a<q 



Hence, 
(15) 



J2 R *(") 2 = ^/(") 2 S 3 («, Q? + 0((2 4 2 5 ) 1/2 + S 5 ), 



where 

•1/2 

\3 



S 3 (n,0 = ^A( ? ,n), I(n) = J 



(16) <5 3 (n,Q)=\ A(q,n), I(n) = v(fiY e(-fin) dp 



q<Q 

2 



1/2 



2 4 = 1{n)2 ( E |A ^' ' % = E ( E |A ^' n) ^ n) - /( ^ ■ 

n<x ^ g<g ' n<x ^ g<Q ' 

By E3 Theorem 2.3] and ED Theorem 4.2], 
(17) /(h) = r(3/2) 2 V^ + 0(l) = ^ V" + 0(1), A(^,n)«^ 1/2 . 

Furthermore, since A{q, n) is multiplicative in q, |[T%1 Lemma 4.7] yields 

(is) Yi |A( ^ n)| - n o + |A( ^' n)| + i a ^ 2 '")i + • • •) 

</<e p<e 

« Yl (1 + C!(p,^- 3/2 + 3c lP ~ l ) « («0 f , 

where c\ > is an absolute constant. In particular, we have 

(19) 2 4 « A^ 4+e . 

We now turn to the estimation of S 5 . By Cauchy's inequality and the second bound in (fTTb . 

£ 5 « (log J] J>(n) - /(n, <?)| 2 
Another application of Bessel's inequality gives 

V|/(«)-/(«,#<2 / \v(J3)\ 6 d/3. 

n<x Jp /4 N2 

Using [ 18 , Lemma 2.8] to estimate the last integral, we deduce that 

£ 5 « log Q (<1 2 N 4 P- 2 + 1) « N 2 Q 3+£ . 

q<Q 

Substituting this inequality and (fT9l into (fT5t . we conclude that 

(20) J2 R *W 2 = 5Z^) 2 S 3 (n, Q? + 0(N 3+£ Q 3/2 ). 

n<x n<x 

We then use (fTTl and (fTSl to replace 7(n) on the right side of (l2"0l by | VTi. We get 

J] I(n) 2 Q 3 (n, Qf = — ^»S 3 («, Q) 2 + 0(N 3+£ ) . 

n<x n<x 



Together with (fT4l) and d2"0"l) . this leads to the asymptotic formula 

16 

n<;t 
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(2D Yl r » 2 = E " S3( "' 02 + °(^ 4+£ e~ 1/2 + ^v 3+e e 3/2 ) . 



Finally, we evaluate the sum on the right side of (OTI) . On observing that 63(71, Q) is in fact a 
real number, we have 

^S 3 (n,0 2 = ^ ^2 ^2 (lil2T 3 S(q l ,ai) 3 S(q2,-a2) 3 ^2e((a 1 /q l -a2/q2)n). 

n<t qi,qi<Q i<ai<qi I<a2<cj2 n<t 

(fll,9l)=l («2,92)=1 

As the sum over n equals t + 0(1) when a,\ = a 2 and q x = q 2 and 0{q\q 2 ) otherwise, we get 

J2^(n,Q) 2 = tJ2 q~ 6 \S(q,a)\ 6 + 0(Tl), 



q<Q \<a<q 
(a,q)= 1 



where 



We find that 



3/2 



q<Q \<a<q 
(a,q)=l 



^S 3 (w, Q) 2 = B 1 t + 0(tQ- l + Q 3 ), 



with 



Bi = J2J2 q~*\s(q,a)\ 6 . 

q=\ \<a<q 
(a,q)=l 

Thus, by partial summation, 

^«S 3 («, Q) 2 = (B, /2)x 2 + 0(x 2 Q~ l + xQ 3 ) . 



Combining this asymptotic formula with (ETI) . we deduce that 

r 2 

32' 

n<x 

Recalling ©, we see that © will follow if we show that 

R 8<T(2) 
7^(3) 

This, however, follows easily from the well-known formula (see [7, §7.5]) 

{-^7 if q = 1 (mod 2), 
VS? if^ = 0(mod4), 
ifg = 2(mod4). 

Indeed, (l2"2l yields 

4 v— v -3 ,/ \ 8^(2) 



5l = a Z)«"V(9) = 



3 M 
q odd 

where the last step uses the Euler product of g(s). This completes the proof of our theorem. 
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3. Second Proof of Theorem 



Rankin |[T3l and Selberg [ 17 1 independently introduced an important method which allows one 
to study the analytic behavior of the Dirichlet series 

oo 

Ea(n) 
n s 

n=\ 

where a(n) are Fourier coefficients of a holomorphic cusp form for some congruence subgroup of 
T = SL 2 (Z). Originally the method was for holomorphic cusp forms. Zagier [ 20 1 extended the 
method to cover forms that are not cuspidal and may not decay rapidly at infinity. Miiller ifTTllT^l 
considered the case where a(n) is the Fourier coefficient of non-holomorphic cusp or non-cusp 
form of real weight with respect to a Fuchsian group of the first kind. It is this last approach we 
wish to discuss. Note that if we apply a Tauberian theorem to the above Dirichlet series, we then 
gain information on the asymptotic behavior of the partial sum 

n<x 

We now discuss Miiller's elegant work. For details regarding discontinuous groups and auto- 
morphic forms, see (HI UHl HH H31 HSl HE! - Let H = {z e C : 3(z) > 0} denote the upper half plane 
and G = S L(2, R) the special linear group of all 2 x 2 matrices with determinant 1. G acts on H by 

az + b 

gz = 



cz + d 

c d , 



a \) 

for g = I ,1 eG. We write y = y(z) = Thus we have 



y(gz) 



\cz + d\ 2 

Let dxdy denote the Lebesgue measure in the plane. Then the measure 

dxdy 



djx = 



y 2 



is invariant under the action of G on H. A discrete subgroup Y of G is called a Fuchsian group 
of the first kind if its fundamental domain T\H has finite volume. Let Y be a Fuchsian group of 
the first kind containing +/ where / is the identity matrix. Let T(Y,Xi K X) denote the space of 
(non-holomorphic) automorphic forms of real weight k, eigenvalue A = | - p 2 , %(p) > 0, and 
multiplier system^. For k 6 R, g e S L(2, R) and / : H — » C, we define the stroke operator \k by 

if\kg){z) := (^^] f(gz) 
\\cz + d\J 

where g = ^ eT. The transformation law for / 6 f(Y,x, k, A) is then 

(f\ k g)(z)=x(g)f(z) 

for all g 6 T. Automorphic forms / e f(Y,x , k, A) have a Fourier expansion at every cusp k of Y, 
namely 

A K ja(y) + ^a K>n W (smn) kj > (47T\n + (j. K \y)e((n + ft k )x), 

where fi K is the cusp parameter and a K ^ n are the Fourier coefficients of / at k. The functions W afi 
are Whittaker functions (see [11 , §3]), A K $(y) = if Hk ± and 



A M-J a ^ m+P + Koy l/2 ~ P if Mr = o, p ^ 0, 

*' oW " 1 a,, () y /2 +^, y /2 logy if ^ = o,p = o. 

An automorphic form / is called a cusp form if a K $ = b K $ = for all cusps k of T. Now consider 
the Dirichlet series 

c ft \ \ a K,n\ 

This series is absolutely convergent for 51(5) > 25l(p) and has been shown |[T2l to have meromor- 
phic continuation in the entire complex plane. In what follows, we will only be interested in the 
case / is not a cusp form. If / is not a cusp form and %(p) > 0, then S K (f, s) has a simple pole at 
s = 2%(p) with residue 

(23) &(/) = res S K (f, s) = (4n) m ^b + (k/2,p) V (p K ,(\ + 22l(p)Rol 2 , 

where K denotes a complete set of T-inequivalent cusps, <p Kil (l + 2%(p)) > and b + (|,p) > if 
p + | + | is a non-negative integer. For the definition of the functions (p K<l and b + , see Lemma 3.6 
and (69) in lfT2l. This result d2"3l and a Tauberian argument then provide the asymptotic behaviour 
of the summatory function 

^2\a K J 2 \n+ fi K \ r . 

n<x 

Precisely, we have (see [11. Theorem 2.1] or E21 Theorem 5.2]) that 

(24) V K„| 2 |n + = V resS K (f, *)— + 0(^ +2 ^(log x)*), 
z — ' z — ' s=z r + s 

n<x zeR 

where 2%{p) + r > 0, R = [±2%(p),±2itS(p),0,-r], y = (2 + 82t(p))(5 + \6%{p)Y l , and 
g =max(0, b-l);b denotes the order of the pole of S K (f, s)(r + s)~ l x r+s at s = 2%(p) (0 < b < 5). 

We now consider an application of (l2"4l) . Let Q e Z mXm be a non-singular symmetric matrix 
with even diagonal entries and q(x) = \Q\x\ = \x T Qx, x e Z m , the associated quadratic form in 
m > 3 variables. Here we assume that q(x) is positive definite. Let r(Q,n) denote the number of 
representations of n by the quadratic form Q. Now consider the theta function 

e ( z ) = ^g»feGM. 

xeZ'" 

By [ 1 1 , Lemma 6.1], the Dirichlet series associated with the automorphic form 6 Q is 

(4»r)-»"%(? + 5) 

where 

co 

n=l xeZ'"\{0) 

for 5l(^) > m/2. Using (|2"4*1) . Miiller proved the following (see ifTTl Theorem 6.1]) 

Theorem (Miiller). Let q(x) = \Q\x\ = ^x T Qx, x e Z m be a primitive positive definite quadratic 
form in m> 3 variables with integral coefficients. Then 



Y,r{Q,nf = Bx m - l +0(x im - l) ^) 



where 

m—\ 

and ficoiOo) is given by d2"3l . 

We are now in a position to prove our theorem in page 2. 

Proof. We are interested in the case q(x) = x\ + x\ + x\ and so r(Q, n) = r 3 (n) counts the number 
of representations of n as a sum of three squares. By Miiller's Theorem above, 



Y,r,{n) 2 = Bx 2 + 0(x ul9 ) 



where B is a computable constant. Specifically, we have by (l2"3l (with k = 3/2 and p = 1/4) 

An 2 

B = j—,b\3IA, l/4)^^,,(3/2)|a ( ,o| 2 , 

where K denotes a complete set of r (4)-inequivalent cusps and a L is the 0-th Fourier coefficient 
of 8q{z) at a rational cusp i. Choose K = {1, ~, |}. Then by p. 145 and (67) in flTTl. we have 

k,o| 2 = W?|G(S,)| 2 
where i = u/w, (u, w) = 1, w > 1, W L is width of the cusp t, and 

6 

r> 

w 



|G(5,)| 2 = 2-V 3 | Ve(-x 2 ) 



x=l 

As W1/4 = W\i2 = \,W\ = 4, we have |ai,ol 2 = 1, l«i/2,ol 2 = 0, and |ai/4,ol 2 = 1- An explicit 
description of the functions (fco^s) in the case T (4) is given by (see (1.17) and p. 247 in Q) 

,l-4. n ,-2n-l.l/2^-l/2)^-l) 



^, 1/ 4(5) = 2^(l-2- z T'^ 



f„\ ,„ 0-2iY1 1^2^.1/2 ^^ 1/2)^(25 1) 

<Pco,i/2(s) = (foo,i{s) = 2 (1-2 ) (1 - 2 )n — . 



Thus for s = 3/2, we have 

^,1/4(3/2) = 2- 3 (l - 2~T V 



/4 ^ / .^z,- 5 (l-2-W r(3 ^ (3) , 



Voo,i/ 2 (3/2) = ^oo,i(3/2) = 2~ 3 (1 - 2- 3 r ! (l - 2~V " P) 



r(3/2)£(3) 
Now, from p. 65 in lfT2l . we have 

Z7 + (3/4,l/4) = G; /4J/4 (3/2). 

By Lemma 3.3 and (16) in 0, 

G; /4il/4 (5) = r(5 + i/2r 1 



and so b + (3/4, 1/4) = r(2) _1 . In total, 



(3-l)r(2)V r(3/2)£(3) 



C(2) \ 

rc - - 

Sn 4 



+ 2- 5 (i-2- 3 rv /2 - 

r(3/2)^(3)y 



21^(3) 

Thus 



8tt 4 2 



21^(3) 

□ 



Remark. Miiller's Theorem can also be used to obtain the mean square value of sums of N > 
3 squares. Precisely, if r N {ri) is the number of representations of n by N > 3 squares, then a 
calculation similar to the second proof of our theorem yields (compare with Theorem 3.3 in 0) 



n<x 

where 

W N = 



1 n N aN-1) 



(N - 1)(1 - 2~ N ) T(N/2) 2 £(N) ' 
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